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Abstract — Considering the interaction through mutual 
interference of the different radio devices, the channel 
selection (CS) problem in decentralized parallel multiple 
access channels can be modeled by strategic-form games. 
Here, we show that the CS problem is a potential game 
(PG) and thus the fictitious play (FP) converges to a Nash 
equilibrium (NE) either in pure or mixed strategies. Using 
a 2-player 2 channel game, it is shown that convergence 
in mixed strategies might lead to cycles of action profiles 
which lead to individual spectral efficiencies (SE) which 
are worse than the SE at the worst NE in mixed and pure 
strategies. Finally, exploiting the fact that the CS problem 
is a PG and an aggregation game, we present a method 
to implement FP with local information and minimum 
feedback. 

I. Introduction 

In recent literature in wireless communications (see 
ifTTl , El, (3l), it has been shown that channel selection 
(CS) problems in decentralized parallel multiple access 
channels (MAC), where each transmitter is interested in 
its own spectral efficiency, can be modeled by strategic- 
form games. More importantly, it has been shown in (U 
that CS problems in MAC are finite potential games 
(PG) [4]. In general, finite potential games belong to 
the class of games for which the existence of at least 
one Nash equilibrium (NE) is ensured @). Moreover, it 
has been shown that the iterative best-response dynamics 
(BRD) and fictitious play (FP) both converge to Nash 
equilibrium in potential games @. The BRD in CS 
problems has been studied in (3J- I n this paper, we 
exclusively focus on the case of FP in its original version 
l6l . Here, it is shown that when several NE exists in the 
CS game, which is often the case at high signal-to-noise 
ratio, FP might converge to (strictly) mixed strategies 
and cycles of action profiles might be observed. This fact 
rises the main question to be answered by this paper: is 
the empirical measure of the frequency of each player's 
actions a good metric to evaluate convergence? Here, we 
use a 2— player 2— action game to show that the actions 
or mixed strategies corresponding to an NE are never 
played even though a convergence of the empirical fre- 
quencies of each player is observed. Surprinsingly, cycles 
of action profiles might lead players to achieve expected 
utilities which are worse than the worst expected utilities 
at NE in pure and mixed strategies. 
Finally, we also show that the CS problem is an ag- 
gregation game [7J . Here, the utility achieved by a 
given player does not depend directly on the actions of 
all the players but on the actions of the given player 
and a linear combination of all the players' actions. 



Interestingly, such a linear combination of actions in 
the CS problem in MAC, is just the multiple access 
interference (MAI) seen at the receiver. Thus, the MAI 
observed over each channel at the receiver can be fed 
back to the transmitters through a common signaling 
message. Using this message, it is shown that the FP can 
be implemented by relying on the fact that transmitters 
are able to obtain an estimate of their own channels and 
to calculate their utilities based on the above mentioned 
signaling message. 

This paper is organized as follows. In Sec. [TTJ we 
formalize the CS problem in parallel MAC and formulate 
the corresponding strategic-form game. In Sec. [TTlJ we 
present recent results on the existence and multiplicity of 
the NE in the CS game. In Sec. [TV] we introduce the FP 
originally introduced in (6] using our notation. Therein, 
it is shown that the CS game possesses the fictitious play 
property and thus, FP converges to NE in the CS game. 
In Sec. |VJ we study the convergence of the FP in a 2- 
player 2-channel game and describe a two-action-profile 
cycle. In Sec.|VTl we exploit the fact that the CS problem 
is an aggregation game to provide a practical way of 
implementing FP with milder information assumptions 
than its orig inal version. Finally, the paper is concluded 
by Sec. |Vn| 

II. Models 

A. System Model 

The channel selection (CS) problem in the parallel- 
MAC can be described as follows. Assume that there 
exist a set K. = {1, . . . , K} of transmitters communicat- 
ing trough a common set S = {1, . . . , S} of orthogonal 
channels with a unique receiver. Channel s G S has 
a bandwidth of B s Hertz and B = Yls=i Bs Hertz. 
Each transmitter communicates with the receiver using 
a unique channel. Limiting transmitters to use a unique 
channel in decentralized networks is optimal for the 
global spectral efficiency of the network (see JSJ, |2), 
|3l). Let t G IN be a discrete time index. Denote by 
Pkb) = (Pk,i{t),...,p k ,s(t)) the PA vector of trans- 
mitter k G fC at time t > 0. Here, Pk, s (t) represents 
the transmit power of transmitter k G rC over channel 
s G S at time t > 0. The set of available PA vectors 
for transmitter k are {pk,m ax e n ) neS , where p k , max is the 
maximum transmit power of transmitter k and e n is the 
ra-th vector of the set of unitary vectors of the canonical 
base of H s . Here, Vs G S, e s = (e S) i, . . . , e Sj s), 
and Vn G S \ {s}, e Sin = and e s>s = 1. For all 
(k, s) G K x S, gk, s represents the channel gain between 



transmitter k and the receiver through channel s. The 
received signal at the receiver at time t, denoted by the 

vector y(t) = (yi(t), ys(t)) T , where Vs G S, is 

K 

Vs(t) = ^2g k , s x k ,s(t) +w s . (1) 
fc=i 

Here, the symbols transmitted by transmitter k at time t 
are denoted by x k (t) = (x k i(t), . . . ,x ki s(t)) T - F° r ai l 
(k,s) G K, x S at each time t, x k>s (t) is a realization 
of a Gaussian random variable with zero mean and 
variance Pk, s (t), i.e., E[x ktS (t)x k>s (t)*] = p k , s (t). The 
vector w = (w±, . . . , ws) is a S'-dimensional additive 
white Gaussian noise process with zero mean and co- 
variance matrix diag (af, . . . , <t|). Hence, assuming that 
the receiver implements single user decoding (SUD), the 
spectral efficiency of transmitter k G K can be written 
as 

n4 P W)=E se ^io g2 (i + ^ +E3 ^ s(t)8js ). (2) 

At each time t, the aim of each transmitter is to choose 
the channel which maximizes its own spectral efficiency 
regardless of the spectral efficiency of its corresponding 
counterparts. 

B. Game Theoretic Model 

Assume that the strategic-form game 
Q = (/C, {A k } k€K , {u k } keJC ) models the CS problem 
in parallel-M AC. Here, the set K of players is the set of 
transmitters, and the set of action profiles is, VA; G K, 

A k = {f»fc,max ei, . . . ,Pfc,max e S } . (3) 

We denote by pi tne PA vector p kmax e n , V(/c, n) G 
K x S. An action profile of Q is a super vector p = 

(pi, . . . ,p K ) G A, where A = A\ x . . . x Ar:. The 
utility function : ^4 — >• R measures the benefit that 
player k obtains when it plays a specific action given the 
actions adopted by all the other players. Here, the utility 
function u k is defined in ©, for all k G /C. 
We assume the game Q is dynamic in the sense that 
it is repeatedly played a large number of times. We 
denote by p k (t) G A k , the PA chosen by player k at 
time t. For the ease of notation, we denote u k (t) = 
u k (p fc (i),p_ fc (t)). At each time t, each player k G K, 
chooses its action p k (t) following a probability distri- 
bution 7r fe = k^),."!^) ^ A (Ak), where 
V(fe, s) G K x 5, 7r fc (») represents the probability that 
player k uses channel s, i.e., for a given time t > 0, 

7r rf= Pr (^( i )=4 s) )- w 

In this paper, we refer to n k as the mixed strategy 
of player k G K. Our interest is to find the set of 
strategy profiles TT k G A (A k ), Vk G K,, such that 
once played, every player obtains its maximum bene- 
fit (spectral efficiency) given the strategies of all the 



other players. Under these conditions, no player would 
be interested on changing its strategy since it would 
represent a decrement of its own utility. Strategy profiles 
satisfying this condition are known as Nash Equilibrium. 
In general, an NE is defined as follows: 

Definition 1 (Nash Equilibrium): A mixed-strategy 
profile 7V* is an NE if, for all players k G /C and Vs G S 

u k (Tr* k ,TT*_ k ) > u k (e s ,ir*_ k ), (5) 
where, u k : A (Ai) x . . . x A (Ak) -> R 

u k (tt) = E w [u k (p k ,P-k)] ■ ( 6 ) 
III. Existing Results 

In this section, we introduce some definitions and 
existing results required in later sections. First, we in- 
troduce the concept of potential games (PG). Second, 
we provide elements on the existence and multiplicity 
of the NE. We define a PG as follows: 

Definition 2 (Exact Potential Game): Any 
game in strategic form defined by the 3-tuple 
(JC, (A k ) ke ic , ( u fc)fcg/c) ™ an exact potential game 
if there exists a function <j) (p) for all p G A such that 
for all players k G /C and for all p' k G A k , it holds that 

MPk,p-k)-uk(p'kiP~k) = <t>(p k ,p- k )-<t>(p' k ,p-k)- 

In lO, 01, it is proven that the CS problem in parallel- 
MAC is a finite PG. Hence, the existence of at least one 
NE is guaranteed [4]. Moreover, it is proven in @, that 
several NE might simultaneously exist and can be easily 
identified. We summarize those results in the following 
proposition. 

Proposition 3 (The Channel Selection game is a PG): 
The channel selection game Q = 
(1C, {Ak\k&K ' { u k}ke!c) ' s a potential game with 
potential function <j> : A — > R 



4>(P) = ~£ l0g 2 ( ff s + 2 Pk,s9k,s ) • (7) 
s£S V k=l / 

Denote by L G IN the number of pure NE. Hence, 1 ^ 

L ^ S K -\ 



IV. Channel Selection Games and Fictitious 

Play 

In 0, it has been shown that the iterative best- 
response dynamics (BRD) converges to pure NE in 
the CS in parallel-MAC games. Conversely, the simul- 
taneous BRD does not necessarily converge in pure 
strategies. Here, we study another dynamic known as 
fictitious play (FP). 



A. Description of Fictitious Play 

The fictitious play can be described as follows. As- 
sume that transmitters have complete and perfect in- 
formation, i.e., they know the structure of the game Q 
and observe at each time t the PA vectors taken by 
all players. Each transmitter k G K, assumes that all 
its counterparts play independent and stationary (time- 
invariant) mixed strategies tvj, Vj G K \ {k}. Under 
these conditions, player k is able to build an empirical 
probability distribution over each set Aj, Vj G K, \ {k}. 

Let fk, Pk (t) = jEi=i 1 {p i ( s )= Pt } be the (empirical) 
probability with which players j G IC\{k} observe that 
player k plays action p k G At- Hence, VA; G K and 
Vp fc G Ak, the following recursive expression holds, 

h, Pk (t+i)=h, Pk W+^T (i {pfc(f)=Pfc} -/ fc , Pfc (*)) • (8) 

Let fk, P _ k (t) = J/j,p 3 (*) be the probability with 

which player k observes the action profile G A-k 
at time t > 0, for all k G K,. Let the \ A-k\ —dimensional 

vector f k (t) = (fk, P X A e A(A- k ) be 

the empirical probability distribution over the set A-k 
observed by player k. In the following, we refer to the 
vector f k (t) as the beliefs of player k over the strategies 
of all its corresponding counterparts. Hence, based on 
its own beliefs f k (t), each player k chooses its action 

at time t, Pk{t) = pi n , where nkif) satisfies that: 

n k (t) G argmaxu fc (e s ,f k (t)) , (9) 

s€S 

where, for all k G K,Uk is defined in ©. From (HJ, it can 
be implied that playing FP, players become myopic, i.e., 
they build beliefs on the strategies being used by all the 
other players, and at each time t > 0, players choose the 
action that maximizes the instantaneous expected utility. 
Hence, a natural question arises: are players always able 
to build their respective beliefs?, i.e., does the learning 
process © converges to a specific strategy profile? We 
tackle these questions in the following subsection. 

B. Convergence of the Fictitious Play 

The game Q = (/C, {Ak} keK > { u k}keic) is said to 
have the fictitious play property (FPP), if the following 
holds, for all k G K., and for all p k G Ak, 



limf k , Pk (t) = r ktPk , 



(10) 



and > fk, P _ k = II #jy yp -k G A - k > is a time " 
jeic\{k} 

invariant probability measure over the set A-k- When 
condition (fTOb holds for all players, it is said that the 
FP converges empirically to the probability distribution 



fi 



, for all k G K,. Now, from Def. 

'v P _ k eA- k 

[T] the mixed strategy profile tt = (7Ti, . . . , ttk), with 

77 k = (4V i) '"' , ^,p[ s) )' for a11 k G /c ' is an NE 

strategy profile. 



Many classes of games have been proved to have the FPP 
(see [5] and references therein). In particular, potential 
games have the FPP and so does the CS game. Hence, 
we write the following proposition 

Proposition 4 (Convergence of FP in CS): The ficti- 
tious play converges empirically to the set of Nash 
equilibrium in the CS game in parallel-MAC. 

In the following sections, we study the implications of 
convergence of the FP in the CS game and the required 
information assumptions. 

C. Practical Limitations of Fictitious Play 

As presented in its original version |6j, the FP requires 
complete and perfect information. This is the same as 
stating that each transmitter, at each time t > 0, is aware 
of the number of active transmitters in the network, their 
set of actions, their utility function and moreover, it is 
able to observe the action played by each one of all 
the other transmitters. Clearly, this assumption is not 
practically appealing since it would require a massive 
signaling between transmitters, which reduces the spec- 
tral efficiency of the whole network. Additionally, as we 
shall see, in the high SNR regime, the CS problem has 
the same structure of a potential coordination game O. 
In this kind of games, the set of probability distributions 
f k , Vfe G /C, converges but not necessarily the actions, 
i.e., fictitious play might converge to a strictly mixed 
strategy profile. When FP converges to a mixed strategy, 
it is possible that players cycle around a subset of action 
profiles, which might lead to an expected utility which 
is worse that the worst expected utility at the NE in 
pure and mixed strategies. In the following section, we 
present a simple study case where it is easy to evidence 
this cycling effect. 

V. Study Case: A 2 x 2 Channel Selection 
Game 

Consider the game Q = {lC,{Ak} kel c>{ u k}kelc)> 
with K = 2 and 5 = 2. Assume also that V/e G fZ, 
Pfc,max = Pmax and Vs G S, o 2 s = a 2 and B s = |. 
Denote by SNR=£^yp the average signal to noise ratio 
(SNR) of each active communication. Note that since 
Q is a PG (and more importantly a Best-Response PG 
|[T0l ), the set of NE of Q is equivalent to the set of NE of 

the game Q' = (JC, {A k } k ^ K , {0} fc6 /c)- In the § ame G'> 
all players have the same interest (same utility function) 
and obtain the payoffs shown in Fig. [TJ 
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P 2 = (Pn, a x,0) 


P 2 = (0, Pma x) 


Pl = (Pmax,0) 


£ log 2 (<' 2 +Pmax(311+S2l)) 


5 1°S 2 (<T 2 +Pmax31l) 
+ ^ lo S 2 ( CT2 +PmaxS22) 


Pl = (0,?Wx) 


i log 2 (cr 2 +Pmax.912) 
+ ^ lo g2 (<T 2 +p max g2l) 


\ log 2 (CT 2 +p max (912 +922 )) 

+ ^l°s 2 (- 2 ) 



Fig. 1. Potential function </> of the game Q — 

(/C,{A} fceK ,{Mfc} fceK ), with K = 2 and S = 2. Player 1 
chooses rows and player 2 chooses columns. 



1) Nash Equilibria: We identify the NE in pure 
strategies of the game Q' (and thus Q) in the following 
proposition: 

Proposition 5 (Nash Equilibria in pure strategies): 
Let the PA vector p* = (p*,P2) 6 A be one NE in 
the game Q. Then, depending on the channel gains 
{9k,s}\/( k s ) e /cxS' tne NE p* can be written as follows : 

• Equilibrium 1: when g G Hi, with 

Ux = {g G El : fii > „ m l. and 



^ l+SNRg 22 
^ 1 +SNR Sll }, 



(11) 



S22 

?nen, pi = (p max , 0) arcc? p\ = 
Equilibrium 2: When g G 7?2. 

W 2 = {9 G R| : f 11 ^ 1 +SNR 921 W 



= (0,Pmax) 

with 



g ^ 1 +SNR911 }, 



(12) 



r/ten, = (p ma x, 0) and = (j> max , 0). 
Equilibrium 3: w/iot 5 = (511,512,921,322) S -^3 

{5 G III ' 911 " 1 



H, 



flu 
flia 

321 

922 



1+SNR 9 22 

} 



and 



(13) 



l+SNRgia 
a«d P2 = (0, P. 



then, pi = (0,p m 

Equilibrium 4; w/jen 5 G %4, w/f/j 
{g G Hi 



^4 



^ 1 +SNRgi2 a«d 



flu 

fll2 

^ ^ , , CT 1„ , }, 



(14) 



1- 



922 " 1 + SNRt,! 

then, p\ = (0,p max ) and p* 2 = (p max ,0). 
The proof of Prop. [5] follows immediately from Def. Q] 
and Fig. Q] The sets Hi, ... , %4 are plotted in Fig. |2j in 
order to provide an insight on the different types of equi- 
librium. Note that regardless of the channel realization 5, 
there always exists an NE. Moreover, for certain channel 
realizations, when g G Hi fl H4, both pi 



IP (1) ,P (2) ) 



and 

,(2) 



,(2) .„(!)> 



v P w ,P Vi 'J with = (0,p max ) and 

p w = (pmax, 0), are both NE. In fact, it is shown in 
that at high SNR, it is highly probable that g G Hi (IH4 
and two NE in pure strategies are always observed. 

Now, following the result in (TT), it can be implied 
that when there exist two NE in pure strategies, there 
exists a third NE in mixed strategies. When, there exists a 
unique NE in pure strategies, the NE in mixed strategies 
coincides with the NE in pure strategies. We summarize 
this observation in the following proposition 

Proposition 6 (NE in Mixed Strategies): 
probability measure over the set A k , V/c G /C. Then, 
7r* K ) is an NE in mixed strategies of the 



Let iv* k be a 



7T~ = {TV 

game Q 



i- 



(JC, {Ak} k £ K , { u k} k eic)> $ and onl y V> the 
channel realizations {5fe,s}v(jt s )elCxS sa ^ s fy that g G 
Hi n Ha and, 



..I2.D- 0(2,2) 



0(l,2) + 0(2,l)-0(l,l)-0(2,2) ' 
. 0(l,2)-0(2,2) 



0(1,2)-0(1,1) 



0(1,2) + 0(2,1)- 
. 0(2,1)- 



0(l,l)-0(2,2) ■ 
0(1,1) 



2,1 4,(1, 2) + 0(2,l)-0(l, l)-0(2, 2) ' "2,2 4,(1 ,2) + 0(2 , 1) -0(1 , 1) - 0(2 ,2) ' 

Note that under the assumption g G Hi fl H4, it holds 
that V(fe, s) G /C x 5, 7r^ > 0, and thus, the game ( 
and so Q) possesses two NE in pure strategies and one 
NE in mixed strategies. 



Pi = (Pmaw<5) 
Pb = (PmAx, 0) 




Fig. 2. Nash equilibrium action profiles as a function of the channel 
realization vector g — (<?n, gi2, <?2i, 322) for the two-player-two- 
channel game Q. Here, the function rj) : R+ — > R+ is defined as 
follows: ^(x) = 1 +SNRa:. Note that it has been arbitrarily assumed 
that (721 > fln and gi2 > 322- 



2) Convergence of the FP: In the case the NE is 
unique in the CS game Q, the FP converges to the 
unique NE in pure strategies (Prop. |4]). Nonetheless, 
when several NE simultaneously exist, the FP converges 
to the NE either in pure strategies or mixed strategies. In 
the following, we show a case of convergence in mixed 
strategies using the FP. 

Assume that both players starts the game with the 
initial beliefs //to) = {fj, P w(to), fj, P w(to)), such 
that /j-p(u(to) = 1^: and fj tP m(t ) = j^-, with 
< £j < 1, for all j G K. Hence, based on these beliefs, 

both players coincide choosing the action p^ at t = to- 
Following dHJ, it yields, V/c G /C, and Vn G {1, . . . , 00}, 



4 ip(1) (t +2n-l) 
4, p (2)(*o+2n-l) 

/ fc „Ci) (*o+*») 
/ fcp(2) (io+2n) 





»« fc + (n-I) 












i+e fc 




«+i)s fc +™A 




i+e fc / 


! 


«-i)« fc +" s | 




i+«fc ^ 



(15) 



Here, as long as the following condition holds V/c G /C 
and a given n G {1, . . . , 00}, 

»fa+0-i < *(- (2) - (1) )-H- (2, - (2) ) < "(^ +1 )« fc a6) 

"(^ + 1 )"«* 0(p< 1 ),p( 2 ))-0(p( 1 ),p( 1 )) ^>.(5 fc +i)-S fc ' 

then, the following outcomes are observed, 

p fc (2n - 1) = p« and p fc (2n) = p^ . 

This implies that transmitters will cycle around the 
outcomes (p^\p^) and (p( 2 ),p( 2 )). Note that if 

0( p (2) iP (D )-4,( p m , p m)=4,( p w , p ( 2 > ) -^(p* 1 ) .pW), (17) 

then, the beliefs of each player converge to TTf.,3 = h< 
for all (k, s) G /C x S and players perpetually iterate 
between actions (p^\p^) and (p( 2 ) p(2)^ Here, even 



though 7Tfc 



'1 v 

,2' 2. 



, for all fc G /C, is an NE in mixed 



strategies according to Prop. [6l the achieved expected 
utility can be worse than the worst expected utility at 
NE in pure and mixed strategies. This can be explained 
by the fact that the pure strategies corresponding to 
the NE, i.e., pt = (pt 1 ),^ 2 )) and ptt = (p^pM), 
are never played. Hence, if the channel realizations are 
those such that sharing the same channel is always worse 
than using orthogonal channels, i.e., <fi (p^ 2 \ p^) >> 

cf> (p( 2 W 2 )) and (p«,p( 2 )) » <j> (p (1) ,p (1) ), then, 
a worse utility than the worst NE either in pure or mixed 
strategies is observed. 

Interestingly, if the differences ^{p ->P } ~ 
<Up( 2 \p( 2 )) and 0(pW p< 2 )) - 0(p« p«) are 
sufficiently close, then, a large number n in (fT6l ) 
is required for the FP to quit the cycle mentioned 
above. This implies that a long time is required 
for players to play the four actions profiles and 
thus, obtain the expected utility corresponding 
to the NE in mixed strategies. Here, as long as 

^(p w iP (1, )-*(p (2) iP P) )/*(p (i) >p (!) )-*(p <i) ,p !1) )i there 
always exists an uq < oo, such that Vn > no, condition 
(fTTT ) does not hold, and thus, the cycling effect is not 
longer observed. 

VI. On the Information Assumptions 

In this section, we assume that transmitters do not 
observe the actions taken by all the other transmitters. 
All the knowledge about the other transmitters' actions 
is given by a common message sent by the receiver to 
all the transmitters. Such a message 7(f) £ JR S consists 
on a linear combination of the actions of all transmitters, 
i.e., 7(t) = (71 (t), . . . ,7s(*))> where Vs € S, 

which is simply the multiple access interference at the 

receiver over channel s at time t. 

Let us re-define the utility function as follows Vk : Ak x 

Rf -> R+, 

s 

v k { Pk (t)Mt))=u k (p(t))= ^ lo g'( 1+ ,.(o^X.. )- (18) 

s=l 

The games where the utility function of player k € K, 
can be written as a function of the actions of player k 
and a linear combination of the actions of all the other 
players are known as aggregation games Q. 
Hence, if the receiver is able to broadcast the vector ~f(t) 
and each transmitter can estimate its own channel gains 
9k,i, ■ ■ ■ ,9k,s-> each transmitter k is able to calculate the 
following terms 

Qi.,.(t+i)=Q*,,(t)+ 7 i r («fc(Pfc, ra pce„-r(t))-Q fc ,,(t)). (19) 

Let T : A — > be defined as follows F (p fe ,p_ fe ) = 
(Ti {Pk,P-k) i ■ ■ ■ jTs (p fc ,P_fc)), where, for all s 6 S 

K 

{PkiP-k) = <*l + ^Pk,s9k,s- (20) 
k=i 



From CE2>, it holds that, V(fc, s) € /C x 5, 

= u k (e s , f kiP _ k (t)\ . (21) 

Hence, the myopic response in © is Pfc(t) = pi" fc , 
where, 

n fc (i) G argmaxQ fc s (i), (22) 

and V(A;, s) £ /C x <S at time t > 0, the calculation of 
Qk,s(t) requires only the knowledge of the channel real- 
izations over the respective transmitter and the capability 
of calculating the utility function based on the message 

7(<). 

VII. Conclusions 

In this paper, we have shown that fictitious play (FP) 
is a feasible and simple algorithm to tackle the prob- 
lem channel selection in decentralized multiple access 
networks. It has been shown that FP always converges 
to Nash equilibrium (NE) in the CS game either in 
pure strategies or mixed strategies. Whenever there exist 
several NE in the CS game, the FP might converge to 
mixed strategies and cycles of action profiles might be 
observed. Using a 2 x 2 game, it is shown that such 
cycles might lead to a performance which is worse than 
the worst performance achieved at NE in pure and mixed 
strategies for both players. Finally, we show that the CS 
problem has the structure of an aggregation game, which 
facilitates the implementation of FP requiring only local 
information and minimum feedback. 
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